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The existing Dixon’s test (DT) under classical statistics has been widely applied in a variety of fields. The 
main target of DT is to recognize the outlier or suspicious observation in the sample. The DT available in 
the literature is workable when all the observations in the sample or the population are precise, deter- 
mined and certain. In practice, under the complex system, it may not possible that all observations in 
the data are determined. In such situations, the existing DT cannot be applied for the detection of the out- 
lier value in the sample. In this paper, we will introduce a new Dixon’s test under the neutrosophic statis- 
tics is called (NDT). We will present the testing procedure for the proposed test using the neutrosophic 
statistical interval method. We will discuss its application with the help of an example. 
© 2020 The Author(s). Published by Elsevier B.V. on behalf of King Saud University. This is an open access 
article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). 





1. Introduction 


The estimation and forecasting are done using the data col- 
lected from the respective fields of interest. The statistical tech- 
niques/methods are applied for the data analysis. The collection 
of data from the fields is not an easy task. A lot of money, time 
and effort are needed to collect data from the fields of interest. 
For example, the collection of the ocean data is not an easy task 
with the limited resources, see for example, (Jingang et al., 2017). 
The presence of the potential outliers in the data may mislead 
the policymakers. In the statistics, an observation that is away 
from the other observations or has many variations in the sample 
is known as the outlier. For an accurate analysis of the data, the 
extreme values should be identified and eliminated from the sam- 
ple. The elimination of the extreme values of the data is an impor- 
tant field of data mining. According to (Manoj and Senthamarai 
Kannan, 2013), “The identification of outliers can lead to unex- 
pected knowledge discovery in areas such as credit card fraud 
detection, criminal behaviors detection, computer intrusion detec- 
tion, calling card fraud detection, etc. Applications such as outlier 
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detection customized marketing, network intrusion detection, 
weather prediction; pharmaceutical research and exploration in 
science databases require the detection of outliers”. (Dixon, 
1950; Dixon, 1951) proposed the test for the identification of the 
extreme value in the data. Several authors paid their attention to 
this issue and presented valuable work in the detection of the out- 
lier using the Dixon’s test, see for example, (Verma and Ruiz, 2006; 
Böhrer, 2008; Manoj and Senthamarai Kannan, 2013; Jabbari 
Nooghabi, 2017; Jingang et al., 2017; Jabbari Nooghabi, 2019). 

In practice, for the variable data, it may not always possible to 
have the exact observations and attribute data may have the cate- 
gories. In these situations, data analysis may be overestimated or 
underestimated. The fuzzy logic, which is the alternative to be 
applied for the analysis of fuzzy data. Therefore, data mining using 
fuzzy logic helps to eliminate the outlier from the imprecise data. 
(Rajeswari et al, 2014) discussed the fuzzy-based approach method 
to detect the outlier in educational data. More details about the 
detection of outlier in fuzzy data can be seen in (Hung and Yang, 
2006; Gładysz and Kuchta, 2007; Cateni et al., 2007; Yousri et al., 
2007; Top et al., 2009; Ramli et al., 2010; D'Errico and Murru, 
2012; Kim et al., 2014; Duraj et al., 2019). 

The neutrosophic logic which is the generalization of the fuzzy 
logic is introduced by (Smarandache, 1998). The neutrosophic 
deals with the measure of the indeterminacy. The neutrosophic 
statistics, which is the extension of classical statistics, used when 
there is Neutrosophy in sample or data. The neutrosophic statistics 
is applied when the observations in the sample or the population 
are indeterminate, vogues and imprecise. The neutrosophic statis- 
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tics is quite effective and adequate to be applied in the uncertainty 
system, see for example, (Smarandache, 2014; Aslam, 2018b). Chen 
et al. (2017a) and Chen et al. (2017b) presented the application of 
the neutrosophic numbers in the area of rock engineering. Aslam 
(2018b) and Aslam (2019) introduced neutrosophic statistical 
quality control (NSQC). 

As we mentioned earlier, the presence of the outlier can affect 
the forecasting and estimation of the data. Several authors con- 
tributed much work for the detection of the outlier using Dixon’s 
test under classical statistics and using fuzzy logic. By exploring 
the literature and according to the best of our knowledge, there 
is no work on Dixon’s test under the neutrosophic statistics. In this 
paper, we will originally propose Dixon’s test under the neutro- 
sophic statistics for the identification of the outlier in the data or 
sample. We will develop Dixon’s test under the neutrosophic 
statistics and explain with the help of an example from the Sam- 
sung Company. We expect that the proposed Dixon’s test under 
the neutrosophic statistics will be more effective and adequate to 
be applied under an uncertainty environment. 


2. Preliminaries 


Suppose that Xny = Xn + uly; Xnn€[Xn,Xnu] be a neutrosophic 
random variable having the determinate part X, and indeterminate 
part uly; Iye[inf, sup]. Note here that the neutrosophic random vari- 
able becomes the random variable under classical statistics when 
Iye[O, sup]. We also assume that Xnpve[Xn,Xnu] is generated from 
the neutrosophic normal distribution with mean 
Uy = H + Uly; HnE| H, Uy] and neutrosophic standard deviation 
On = O + Uly;OnE|OL, Ou]. The neutrosophic mean and standard 
deviation reduce to mean under classical statistics when 
Xn. = Xnu. More details about the neutrosophic distributions can 
be seen in Smarandache (2010) and Aslam (2018a). 





3. Dixon’s test under NSIM 


Let Xnn = Xn + uly be neutrosophic random variable consists of 
a determined part X, and indeterminate part uly and Iye[I;, Iy] is an 
indeterminacy interval. We assumed that Xnne€[Xni, Xnu] is selected 
from the neutrosophic normal distribution. The purpose of the pro- 
posed Dixon’s test under the neutrosophic statistics is called (NDT) 
is to detect either the extreme value belongs to the group of obser- 
vations or not. Like the existing Dixon’s test under CS, we have the 
following assumptions for the proposed NDT 


3.1. Assumptions 


1. The neutrosophic variable Xnye[Xn, Xnu] is measured from the 
complex system having fuzzy, interval-based and imprecise 
values. 

2. The neutrosophic variable Xnpy€[Xn1,Xnu]} follows the neutro- 
sophic normal distribution. 

3. The size of the neutrosophic variable Xpy€[Xnt,Xnu] should be 
more than [3, 3]. 


3.2. Methods 


In this section, we discuss the testing procedure of the proposed 
Dixon’s test under NSIM is stated. For the testing, the largest or the 
smallest sample observations to the outlier, identify that value and 
put in front. Select a random sample of size nye[n,, ny] from the 
neutrosophic normal distribution and arranged it in ascending 
order or descending order. Let the neutrosophic sample 
Xnn€[Xnr,Xnu] has the values Xin = Xin + Uly, 
Xon = Xn + Uly,.. .Xnn = Xnn + uly. We arrange these neutrosophic 


observations in ascending order as Xiy < Xən < -++ < Xny, where 
Xin and Xn, are the smallest and the largest values of the neutro- 
sophic data. The proposed Dixon’s test statistic under NSIM is 
given by 

















Rion = Ve Ryon€[Riox, Riou], Xny €[Xnts Xnu a 
Ruin = o RiinE[Ri, Riu), Xny ElXn, Xnu (2 
Rain = ane), Roiw€[Raiz, Roiu],Xny €[Xnt,Xnu (3 
Ryn = Cr Tac, Rozn€[Roa1, Ro2u],Xny €[Xnt, Xnu (4 


By following (Dixon, 1950; Dixon, 1951) suggestions, we clas- 
sify the proposed test statistic with respect to the suitable values 
of nye[n,, ny] as follows 


(Xay -Xny-1) 


Rion = n Xn) O] < ny < [7,7] 
= (Xny Xn -1) 7 
Rin = Kan Xon) [8,8] < ny < [10,10] ; RR R] 
Ray = Cw) 194,11) < n < 013,13) 8 EE 
21N = T Xan) S ’ ] N [ , ] 


la, 
Ron = (ny Xan) ; [14,14] < ny < [30,30] 


(5) 


The neutrosophic form of the proposed test can be written as 
Ry = An + Byly; Ive[l, Iu] (6) 


where Ay and Byly are the determinate and indeterminate parts of 
the proposed test. Note here that the proposed Dixon’s test statistic 
under the neutrosophic statistics is the generalization of the (Dixon, 
1950; Dixon, 1951) test statistic under classical statistics. The pro- 
posed Dixon’s test statistic reduces to (Dixon, 1950; Dixon, 1951) 
test statistic if Xn, = Xny or I, = 0. More details on the existing statis- 
tics can be seen in (Kanji, 2006). To test whether the suspected 


observation is an outlier or not, the neutrosophicR®™, which is 
based on the ascending order of observations, Xn; €[Xnz, XnuJis com- 
puted. According to the test, an observation is declared to be an out- 


lier if the calculated value of RẸ™ exceeds the critical value at a 
specified level of significance a. 


4. Example 


In this section, we discuss the application of the proposed Dix- 
on’s test statistic in the area of neutrosophic statistical quality con- 
trol (NSQC). The Samsung Company is manufacturing several 
electronic equipment and mobile phones. This company is very 
popular in Saudi Arabia due to the high reliability of their product. 
The Samsung Company is interested to apply the proposed Dixon’s 
test statistic on cell phone batteries failure time. For the inspection 
of the batteries product, the experimenters at the company have 
selected a sample at regular interval during the manufacturing 
process. Due to the complexity of the lifetime measuring process, 
on several processes, the measurements are not accurate, precise 
and sometimes in an interval. Therefore, there is Neutrosophy in 
the failure time of the batteries product. Therefore, for the testing 
of the outlier in the failure time, the existing Dixon’s test statistic 
under classical statistics cannot be applied. The failure time of 12 
batteries having neutrosophic observations is shown in Table 1. 
The company is expected to have failed time greater than 
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Table 1 

The neutrosophic failure time. 
[973,975] [1214,1220] [1668,1668] [1903,1903] 
[1056,1056] [1449,1451] [1798,1800] [1921,1925] 
[1207,1210] [1566,1570] [1867,1870] [1946,1950] 





1000 h. From the data, they note that one failure time is from 973 
to 975. The company is interested to test it either it is an outlier or 
not in the sample. Therefore, we apply the proposed Dixon’s test 
statistic to test the null hypothesis that Xın€[973, 975] is an outlier 
Vs the alternative hypothesis that it is not an outlier and belong to 
the same group. There are 12 observations in the sample, therefore, 


we will use Rg" [R™, RS” | based on Roiye€[Ro1z, Roi]. The null 


hypothesis will be rejected if the value of RY €[R™, RE” | based 


on Roiw€[Raiz, Roiv] will be greater than critical. Some necessary cal- 
culations for this data are shown as follows. 

Step-1: Use the neutrosophic Kolmogrov-Smirnov (NK-S) test to 
see either the available data follows the neutrosophic normal dis- 
tribution or not. The neutrosophic P-value of Xnn€[Xn,Xnu] is [0.79, 
0.80] which indicates that given lifetime data follows the neutro- 
sophic normal distribution. 

Step-2: arrange the batteries failure time ny € [12, 12] in 
ascending order, say [973,975] <[1056, 1056] < --- < [1946, 1950] 
as in Table 1. LetX12n€[973,975], Xion€[1207, 1210] and X2ne[1921, 
1925]. 


Step-3: Compute R = axo) — rae79- 12011399 — 19.2468, 
0.2473]. 


Step-4: From (Kanji, 2006), the tabulated value is [0.546, 0.546] 
when «@ = 0.05 and ny € [12, 12]. 

Step-5: The computed RY") €[0.2468, 0.2473] is smaller than the 
critical value [0.546, 0.546]. Therefore, it is concluded that the 
suspected value in the sample is not an outlier. 





5. Discussion 


We presented the testing procedure of the proposed Dixon’s 
test in Section 4. We note from the calculations of the real data 
under the neutrosophic statistics; the proposed testing procedure 
provides the analysis of the data in the indeterminacy interval 
rather than the exact or determined values as in classical statistics. 
Chen et al. (2017a) and Chen et al. (2017b) suggested that a 
method which provides the values in the indeterminacy interval 
under uncertainty is said to be a more effective and adequate 
method. The neutrosophic form of the proposed test for the data 


is: RO") = 0.2468 + 0.2473Iy; Ive[0, 0.0020]. Note here that there 
is measure of indeterminacy 0.0020, which is associated with the 
proposed test under uncertainty. The analysis of the lifetime of 
the data suggested that the values of the proposed statistic can 
be from 0.2468 and 0.2473. On the other hand, the existing testing 
procedure given by Dixon (1950) and Dixon (1951) which is the 
special case of the proposed testing procedure provide the exact 
value of the statistic which is 0.2468. Furthermore, the existing test 
does not provide the measure of indeterminacy associated with the 
test. From this comparison, it is clear that the proposed testing pro- 
cedure is more adequate and effective to be used in uncertainty 
environment than the testing procedure given by Dixon (1950) 
and Dixon (1951) under classical statistics. 


6. Concluding remarks 


In this paper, we introduced a new Dixon’s test under the neu- 
trosophic statistics. We presented the testing procedure for the 


proposed test using the neutrosophic statistical interval method. 
The proposed testing procedure for the detection of the outlier in 
the data is more adequate and effective than the testing procedure 
available in the literature. The proposed testing procedure is more 
information to be applied when observations in the sample or the 
data are imprecise and fuzzy. The comparison between the pro- 
posed testing procedure and existing testing procedure support 
claim and coincides with the theory of Chen et al. (2017a) and 
Chen et al. (2017b). The proposed Dixon’s test under the neutro- 
sophic statistics is an effective alternative of the existing Dixon’s 
test under classical statistics. The proposed Dixon’s test statistic 
can be used to detect the outlier in the data for the better infer- 
ences from it. We will introduce the proposed Dixon’s test in the 
area of control chart theory as future research. 
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